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^> ' Abstract 

Mh ' We study the effect of boundary conditions on the relaxation time (i.e., inverse 

t~i . spectral gap) of the Glauber dynamics for the hard-core model on the tree. The 

"jrt I hard-core model is defined on the set of independent sets weighted by a parameter 

A, called the activity or fugacity. The Glauber dynamics is the Markov chain that 
updates a randomly chosen vertex in each step. On the infinite tree with branching 
factor 6, the hard-core model can be equivalently defined as a broadcasting process 
with a parameter uj which is the positive solution toA = w(l-|-w)'', and vertices are 
occupied with probability a;/(l -I- w) when their parent is unoccupied. This broad- 
ly I casting process undergoes a phase transition between the so-called reconstruction and 
(NJ ■ non-reconstruction regions at ujr ~ In 6/6. Reconstruction has been of considerable 
CN ' interest recently since it appears to be intimately connected to the efficiency of local 
j~^ . algorithms on locally tree-like graphs, such as sparse random graphs. 
f^ ' In this paper we show that the relaxation time of the Glauber dynamics on regular 
^D . trees T^ of height h with branching factor b and n vertices, undergoes a phase tran- 
sition around the reconstruction threshold. In particular, we construct a boundary 
condition for which the relaxation time slows down at the reconstruction threshold. 
More precisely, for any a; < In 6/6, for Th with any boundary condition, the relaxation 
time is r2(n) and 0(n^+°''(^)). In contrast, above the reconstruction threshold we 
C^ ' show that for every S > 0, ioi ui — {1 + 6) In 6/6, the relaxation time on Th with any 
boundary condition is 0(n^+ +°''f^'), and we construct a boundary condition where 
the relaxation time is Q{n^~^^'^^°'''-^^). To prove this lower bound in the reconstruction 
region we introduce a general technique that transforms a reconstruction algorithm 
into a set with poor conductance. 
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1 Introduction 

There has been much recent interest in possible connections between equihbrium properties 
of statistical physics models and efficiency of local Markov chains for studying these models 
(see, e.g., [31 [ini EOj Ell ESI EO] ) . In this paper we study the hard-core model and establish 
new connections between the so-called reconstruction threshold in statistical physics with 
the convergence time of the single-site Markov chain known as the Glauber dynamics. The 
hard-core model was studied in statistical physics as model of a lattice gas (see, e.g., Sokal 
[29)). and in operations research as a model of communication network (see Kelly [15]). 
It is a natural combinatorial problem, corresponding to counting and randomly sampling 
weighted independent sets of an input graph G = {V^E). Let Q. = Q.{G) denote the set 
of independent sets of G. Each set is weighted by an activity (or fugacity) A > 0. For 
a & Q., its weight is Z{a) = X^"^ where \a\ is the number of vertices in the set a. The 
Gibbs measure is defined over il as ^[a) = Z{a)/Z where Z = X^g-eQ ■^i^) i^ the partition 
function. 

This paper studies the hard-core model on trees, in some cases with a boundary condi- 
tion. Let Th denote the complete tree of height h with branching factor h. For concreteness 
we are assuming the root has h children, but our results, of course, easily extend to allow 
6+1 children for the root, the so-called Bethe lattice. Let n denote the number of vertices 
in T/i, and let L denote the leaves of the tree. A boundary condition is an assignment F 
to the leaves, where in the case of the hard-core model, F specifies a subset of the leaves 
L that are in the independent set. Then let VLy- = {a E O : o"(L) = F} be the set of 
independent sets of Th that are consistent with F, and the Gibbs measure Hh,T is defined 
with respect to i^r, i-c, it is the projection of /U onto J^r- 

The (heat bath) Glauber dynamics is a discrete time Markov chain {Xt) for sampling 
from the Gibbs distribution /i for a given graph G = iy, E) and activity A. We view 
Jl C {0, 1} where for Xt G Q., Xt{v) = 1 iff u is in the independent set. The transitions 
Xt —7- XtJ^i of the Glauber dynamics are defined as: 

• Choose a vertex v uniformly at random; 

• For aW w ^ V set Xt+i{w) = Xt{w); 

• If all of the neighbors of v are unoccupied, set Xt-\-i{v) = 1 with probability A/(l+A), 
otherwise set Xt+i {v) = 0. 

When a boundary condition F is specified, the state space is restricted to Op. For the 
case of the complete tree T^ (possibly with a boundary condition F) it is straightforward 
to verify that the Glauber dynamics is ergodic with unique stationary distribution fih (or 
/i/j^r when a boundary condition is specified). Thus, the Glauber dynamics is a natural 
algorithmic process for sampling from the Gibbs distribution. We study the relaxation 
time of the dynamics, which is defined as the inverse of the spectral gap of the transition 
matrix. See Section El for a more detailed definition of the relaxation time. 

The Gibbs distribution describes the equilibrium state of the system, and the Glauber 
dynamics is a model of how the physical system reaches equilibrium [20]. Thus, it is 
interesting to understand connections between properties of the equilibrium state (i.e., 
the Gibbs distribution) and properties of how the system reaches equilibrium (i.e., the 
Glauber dynamics). Models from statistical physics are designed to study phase transitions 



in the equilibrium state. A phase transition is said to occur when a small change in the 
microscopic parameters of the system (in the case of the hard-core model that corresponds 
to A) causes a dramatic change in the macroscopic properties of the system. 

A well-studied phase transition is uniqueness/non-uniqueness of infinite volume Gibbs 
distributions, these are obtained as a limit of Gibbs measures for a sequence of boundary 
conditions as /i — )• oo. For the hard-core model on the complete tree, Kelly pLS] showed that 
the uniqueness threshold is at A„ = b /{h — 1) "'"^ (namely, uniqueness holds iff A < A^)- 
There are interesting connections between the uniqueness threshold A^ and the efficiency 
of algorithms on general graphs. In particular, Weitz [ST] showed a deterministic fully- 
polynomial approximation scheme to estimate the partition function for any graph with 
constant maximum degree h for activities A < A^. Recently, Sly \^&\ showed that it is NP- 
hard (unless NP = RP) to approximate the partition function for activities A satisfying 
\u < \ < \u + ^b for some small constant e^. 

We are interested in the phase transition for reconstruction/non-reconstruction. This 
corresponds to extremality of the infinite- volume measure obtained by the "free" boundary 
condition where free means no boundary condition [11]. This measure can be generated 
by the following broadcast process which constructs an independent set a. Let uj be the 
real positive solution of A = cj(l + ui) . Consider the infinite complete tree with branching 
factor 6, and construct a as follows. We first include the root r in o" with probability 
a;/(l + oo) and leave it out with probability 1/(1 + oo). Then for each vertex u, once the 
state of its parent p{v) is determined, if p{v) ^ a then we add v into a with probability 
a;/(l -l-o;) and leave it out with probability 1/(1 -l-a;); \ip{v) S a then we leave v out of a. 
Let (Th denote the configuration of a on level /i, and let Vh denote the broadcast measure 
on Th- 

Reconstruction addresses whether a^ "influences" the configuration at the root r. In 
words, we first generate a using the broadcasting measure, then we fix ah and resample a 
configuration r on Th from the Gibbs distribution /i/i r with boundary condition T = ah- Of 
course, for finite h, the configuration at the root r in r has a bias to the initial configuration 
a{r). Non-reconstruction is said to hold if the root is unbiased in expectation in the limit 
/i — 7- oo. More precisely, reconstruction holds if and only if: 



lim 'E^^uh 

h—^oo 



fJ-h,a,SreT^ 



1+UJ 



> 0. (1) 



There are many other equivalent conditions to the above definition of reconstruction, 
see Mossel [21] for a more extensive survey. We refer to the reconstruction threshold as 
the critical Ur such that for all uj < ujr non-reconstruction holds and for all lo > Ur re- 
construction holds. The existence of the reconstruction threshold follows from Mossel [251 
Proposition 20], and, by recent work of Bhatnagar et al [3] and Brightwell and Winkler [6], 
it is known that w.^ = (Infe -f- (1 + o(l)) lnln6)/6. 

Reconstruction for the Ising and Potts models has applications in phylogenetics [8] 
and for random constraint satisfaction problems is connected to the geometry of the space 
of solutions on sparse random graphs [H W2\ [TBI I23j . Our interest in this paper is on 
establishing more detailed connections between the reconstruction threshold and the re- 
laxation time of the Glauber dynamics for trees. Berger et al jiSj proved that for the tree 
Th with boundary condition T such that /z/^^r = ^h^ 0{n) relaxation time for all h implies 
non-reconstruction. For the Ising model and colorings the boundary condition is empty. 



i.e., Vh corresponds to the free boundary condition. That is not the case for the hard-core 
model as discussed further in Section [3l 

It was recently established for the Ising model [3l [2H [7] and for k-colorings [30j that on 
the tree T^ with free boundary condition, the relaxation is 0{n) in the non-reconstruction 
region and there is a slow down in the reconstruction region. Our starting point was 
addressing whether a similar phenomenon occurs in the hard-core model. Martinelli et al 
^22j showed that for the hard-core model on T^ with free boundary condition the relaxation 
time is 0{n) for all A (and the mixing time is 0(n log n)). However, it is unclear whether 
the reconstruction threshold has any connection to the relaxation time of the Glauber 
dynamics on trees for the hard-core model. (In fact, we vacillated between proving that 
there is fast mixing for all boundary conditions and proving the following result.) 

We prove there is a connection by constructing a boundary condition for which the 
relaxation time slows down at the reconstruction threshold. Here is the formal statement 
of our results. 

Theorem 1. For the Glauber dynamics on the hard-core model with activity A = a;(l-|-a;)* 
on the complete tree Th with n vertices, height h and branching factor b, the following hold: 

1. For all uj < In 6/6: 

^{n)< T,eiax <0(ni+"''«). 

2. For all (5 > and a; = (1 + J) In 6/6: 

(a) For every boundary condition, 

Trelax < ©(nl+^+^'-d)). 

(b) There exists a sequence of boundary conditions for all h ^ oo such that, 

r,elax = S^(nl+^/2-^(l)). 

Remark. More precisely, we show that there is a function g(b) = 0(lnln6/ln6) = o(l) 
such that for every 6, the lower bound in Part [2b] is il(n^~'^ '^~^' -*), and there is a function 
/(6) = 0((lnln6)^/ln6) = o(l) such that for every 6, the upper bound in Part [J is 
0(ni+^(^)) and in Part EH is 0{n^+^+f^^^). 

The upper bound improves upon Martinelli et al [22] who showed 0{n) relaxation 
time (and 0(n log n) mixing time) for A < l/(v6 — 1) for all boundary conditions. Note, 
A = l/v6 is roughly equivalent to uj ^ ^ In 6/6 which is below the reconstruction threshold. 
Our main result extends the fast mixing up to the reconstruction threshold, and shows the 
slow-down beyond the reconstruction threshold. Our lower bound in the reconstruction 
region uses a general approach that transforms an algorithm showing reconstruction into 
a set with poor conductance, which implies the lower bound on the relaxation time. This 
framework captures the proof approach used in |30j . 

In Section [2] we formally define various terms and present the basic tools used in our 
proofs. The lower bound (Part [2b] of Theorem [T]) is presented in Sections [S] [4] and [5] 
Section [3] outlines the approach. We then prove an analogue of Theorem [1] in Section [1] 
for the broadcasting model and use it in Section [5] to prove Part I2bl of Theorem [1] The 
argument for the upper bounds stated in Theorem [1] is presented in Section [6] 



2 Background 

Let P(-, •) denote the transition matrix of the Glauber dynamics. Let 71 > 72 > • • • > 7|q| 
be the eigenvalues of the transition matrix P. The spectral gap Cgap is defined as 1— 7 where 
7 = max{72, |7|n||} denotes the second largest eigenvalue in absolute value. The relaxation 
time Treiax of the Markov chain is then defined as c~ l,, the inverse of the spectral gap. 
Relaxation time is an important measure of the convergence rate of a Markov chain (see, 
e.g., Chapter 12 in [15]). 

To lower bound the relaxation time we analyze conductance. The conductance of a 
Markov chain with state space and transition matrix P is given by <!' = min5cr2{*^5}) 
where ^s is the conductance of a specific set S C $7 defined as 

^ ~ 7r{S)TT{S) 

Thus, a general way to find a good upper bound on the conductance is to find a 
set S such that the probability of "escaping" from S is relatively small. The well-known 
relationship between the relaxation time and the conductance was established in |17j 
and [26j . and we will use the form T^eiax = ^(1/*^) ^01 proving the lower bounds. 

3 Lower Bound Approach 

First note that the lower bound stated in Part [T] of Theorem [H namely, Treiax = ^{n), is 
trivial for all oj. For example, by considering the set S = {a £ il. : r ^ a} oi independent 
sets which do not contain the root, $(5) = 0(l/n) since we need to update r to leave S. 

We begin by explaining the high level idea of the non-trivial lower bound in Part 
I2bl of Theorem [ij To that end, we first analyze a variant of the hard-core model in 
which there are two different activities, the internal vertices have activity A and the leaves 
have activity uj. The resulting Gibbs distribution is identical to the measure i^h defined 
in Section [1] for the broadcasting process. Thus we refer to the following model as the 
broadcasting model. 

For the tree T^ = (V, E), we look at the following equivalent definition of the distribu- 
tion I'fi over the set Q of independent sets of T^- For o" G O, let 

where L are the leaves of T^ and uj is, as before, the positive solution to uj{l + uj)^ = 
A. Let Vh{a) = Z'{a)/Z' where Z' = Ylaen^^'i'^) ^^ ^he partition function. By simple 
calculations, the following proposition holds. 

Proposition 2. The measure Vh defined by the hard-core model with activity A for internal 
vertices and oj for leaves is identical to the measure defined by the broadcasting process. 

Proof. Li fact, we just need to verify that in the hard-core model with activity A for internal 
vertices and uj for leaves, the probability p^ of a vertex v being occupied conditioning on 
its parent is unoccupied is uj/{l + uj). This can be proved by induction. The base case is 



V being a leaf, which is obviously true by the Markovian property of the Gibbs measure. 
If V is not a leaf, by induction, the probability pv has to satisfy the following equation 

Py = (1 -pv)- 



{1 + ujf' 
which solves to p^ = uj /{I + oj). D 

The result of Berger et al [3] mentioned in Section [1] implies that the relaxation time 
of the Glauber dynamics on the broadcasting model is uj{n). We will prove a stronger 
result, analogous to the desired lower bound for Part [2b] of Theorem [TJ 

Theorem 3. For all 5 > 0, the Glauber dynamics for the broadcasting model on the 
complete tree Th with n vertices, branching factor b and w = {1 + 5)lnb/b satisfies the 
following: 

where the Ofe(l) function is 0(lnln6/ln6). 

Remark. We can show a similar upper bound on the relaxation time for the Glauber 
dynamics in this setting as in Theorem [H Moreover, we can show the same upper bound 
for the mixing time by establishing a tight bound between the inverse log-Sobolev constant 
and the relaxation time as was done for colorings in Tetali et al |30j . 

We will prove Theorem [3] via a general method that relates any reconstruction al- 
gorithm (or function) with the conductance of the Glauber dynamics. A reconstruction 
algorithm is a function A : il.{L) — t- {0, 1} (ideally efficiently computable) such that A{ah) 
and a{r) are positively correlated. Basically, the algorithm A takes the configurations 
at the leaves L as the input and tries to compute the configuration at the root. When 
the context is clear, we write A{a) instead of A^a^)- Under the Gibbs measure i^h, the 
effectiveness of A is the following measure of the covariance between the algorithm ^'s 
output and the marginal at the root of the actual measure: 

rh,A = min [i^hiM^^) = (^(.r) = x) - iyh{A{a) = x)i//,((T(r) = x)] . 
a;e|0,i} 

If it is the case that lim infh^oo fh,A = cq > for some positive constant cq depending 
only on u and b, then we say that it is an effective reconstruction algorithm. In words, 
an effective algorithm, is able to recover the spin at the root, from the information at the 
leaves, with a nontrivial success, when /i — )• oo. Notice that reconstruction (defined in ([T])) 
is a necessary condition for any reconstruction algorithm to be effective, since 



E,. 



'i^h 



l^h,au{reT) 



l + UJ 



> ^a^ui, [{f^h,ai, (rGr)- Vh (r E o))l{A{a) = 1)] > rh,A, 



where 1() is the indicator function. We define the sensitivity of A, for the configuration 
a G 0(T/i), as the fraction of vertices v such that switching the spin at u in u changes the 
final result of A. More precisely, let a^ be the configuration obtained from changing a 
at V. Define the sensitivity as: 

SA{a) = ^#{v G L : ^(a^) / A{a)}. 



The average sensitivity (with respect to the root being occupied) Sa is hence defined as 

SA = K^uJSA{a)l{A{a) = l)]. 

It is fine to define the average sensitivity without the indicator function, which only affects 
a constant factor in the analysis. We are doing so to simplify some of the results' statements 
and proofs. 

Typically when one proves reconstruction, it is done by presenting an effective recon- 
struction algorithm. Using the following theorem, by further analyzing the sensitivity of 
the reconstruction algorithm, one obtains a lower bound on the relaxation time or mixing 
time of the Glauber dynamics. 

Theorem 4. Suppose that A is an effective reconstruction algorithm. Then, the relaxation 
time Tfciax of the Glauber dynamics satisfies Tj-eiax = ^ {{Sa)~^)- 

Remark. The above theorem can be generalized to any spin system. To illustrate the 
usefulness of this theorem, we note that the lower bound on the mixing time of the Glauber 
dynamics for fc-colorings in the reconstruction region proved in [30] fits this conceptually 
appealing framework. 

Proof. Throughout the proof let i' := i^h- Consider the set U = {a : A{a) = 1}. Then, 

5>r/ = TTTTTT 7^ZZ-s < 



u{U){l-u{U)) - v{U){l-u{U)) 



From the definition of r/i^^, we have that ^{U) > v{A{a) = (y{r) = 1) > r^^A^ and similarly 
(1 — ^{U)) > ^{Aia) = a{r) = 0) > r^^A- Now, because the algorithm is effective, we 
have liminf/j_^oo(^/i,A) = cq > and hence for all h big enough, r^^A > 0. Therefore, 
^U < ('^/i,a)~^5*a, which concludes that Treiax = Cg^p > 1/^u = ^{{SAy^)- □ 

To prove Theorem [3l we analyze the sensitivity of the reconstruction algorithm by 
Brightwell and Winkler [6, Section 5] which yields the best known upper bounds on the 
reconstruction threshold. Our goal is to show that the average sensitivity of this algorithm 
is small. The analysis of the sensitivity of the Brightwell- Winkler (BW) algorithm, which 
then proves Theorem [3l is presented in Section HI 

Our main objective remains of constructing a sequence of "bad" boundary conditions 
under which the Glauber dynamics for the hard-core model slows down in the reconstruc- 
tion region. An initial approach is that if we can find a complete tree T' with some bound- 
ary condition such that the marginal of the root being occupied exactly equals a;/(l + uj), 
then by attaching the same tree T' with the corresponding boundary conditions to all of 
the leaves of a complete tree T, we are able to simulate the nonuniform hard-core model 
on T, (i.e., the resulting measure projected onto T is the same as the one in the broad- 
casting model) and hence we can do the same approach to upper-bound the conductance 
of the dynamics on this new tree. However, from a cardinality argument, not for every cj 
there exists a complete tree of finite height with some boundary condition such that the 
marginal probability of the root being occupied equals uj/{l + oj). Alternatively, we give 
a constructive way to find boundary conditions that approximate the desired marginal 
probability relatively accurately. This is done in Section [5l 



Finally, at the end of Section [5] we argue that since the error is shrinking very fast from 
the bottom level under our construction of boundary conditions, we can again analyze the 
sensitivity of the Brightwell- Winkler algorithm starting from just a few levels above the 
leaves. This approach yields the lower bound stated in Part [2b] of Theorem [H 

4 Lower Bound for Broadcasting: Proof of Theorem [3] 

Throughout this section we work on the broadcasting model. To prove Theorem [3] we 
analyze the average sensitivity of the reconstruction algorithm used by Brightwell and 
Winkler [6], which we refer to as the BW algorithm. For any configuration a as the 
input, the algorithm works in a bottom up manner labeling each vertex from the leaves: 
a parent is labeled to occupied if all of its children are labeled to unoccupied; otherwise, 
it is labeled to unoccupied. The algorithm will output the labeling of the root as the final 
result. Formally, it can be described by the following deterministic recursion deciding the 
labeling of every vertex: 

R (v) = I ^^^^ if V G L 

^ \ I — max{Rcj{wi), Ra{w2), ■ ■ ■ , Raiwb)} otherwise 

where wi, . . . ,Wb are the children of v. Finally, let BW((t) = BW(o-/j) = Ra{r). Note that, 
BW((t) only depends on the configuration ah on the leaves. The algorithm is proved to 
be effective in [6] when 5 > 0. Therefore, it can be used in our case to lower bound the 
relaxation time. In this algorithm, by definition we have 

^BW = 0(n-iE,^^, [#{7; G L : BW(a) = 1 and BW(a'') = 0}]), (2) 

Due to the symmetry of the function Ra{v) and the measure f/j, the expectation can be 
further simplified as 

E^ [#{?; G L : BW((t) = 1 and BWia") = 0}] = b''uh{BW{a) = 1 and BW(ct") = 0), (3) 

where u is now a fixed leaf. To bound the right hand side of Eq.Q, let k G ^(7^) be 
a fixed configuration such that BW(k) = 1. Let the path V from u to the root r be 
uq = u -^ ui -^ ■ ■ ■ -^ Uh = r, and for any i > 0, let Wij be the children of Ui so that 
the labeling is such that for j = 1, Wi^i = Ui-i and for j 7^ 1, Wij is not on the path V. 
An important observation is that, in order to make BW(k) change to by changing only 
the configuration at u of k, a necessary condition for k is Rniui) = 1 — Rniui-i) for all 
i > 1. Then for all i > 1 and j G {2, ... ,6}, we have R^iwij) = 0. To calculate the 
probability that a random n ^ u^ satisfies such conditions, it would be easier if we expose 
the configurations along the path V. Since then, conditioning on the configurations on 
the path, the events R^iwi^j) = are independent for all i,j. And if n{ui) = 0, we have 
for all j > 1, the conditional probability of R^iwij) = equals Pr^^,y._j [BW(r/) = 0], the 
probability BW algorithm outputs a over a random configuration 7] of the leaves of the 
complete tree Tj_i with height i — 1. The analysis above leads to the following lemma, 
which bounds the probability Uh{BW{a) = 1 and BW((t") = 0). 



Lemma 5. For every i > 0, let rj ^ Q(Ti^i) be a configuration chosen randomly according 
to measure Ui-i, then 



Vh{BW{a) = 1 and BW{a^) = 0) < E«. 



'^h 



i>0:K(ui)=0 



Complete proofs of lemmas in this section are deferred to Section [4.11 To use Lemma [5l 
we derive the following uniform bound on the probability Pr^r^^,. [BW(r/) = 0], for all i. 
Here and through out the paper, 6o(<^) is a function explicitly defined in Lemma [8] in 
Section [4.21 This function is of order exp((5~^ ln(5~^)) as (5 — t- and remains bounded as 

(5 — )■ oo. 

Lemma 6. Let 5 > 0, and let u = (1 + 5) In 6/6. For all b > bQ{5) and i > 1, 



Pr^.,, [BWiv) = 0] < 



l + OJ 



Combining Equations ([2]), ([3]), Lemma [5] and Lemma [U we are able to upper bound 
the average sensitivity of the BW algorithm: 



5bw = 0{iyh(BW{a) = 1 and BW{a^) = 0)) = O E«. 



■i^h 



lMuj{l + uj) 



#{i:kK)=0}' 



In this expectation, the number of unoccupied vertices in the path V can be trivially 
lower bounded by h/2, since it is impossible that there exists i > 0, K{ui) = K{ui-i) = 1. 
Therefore, the above expectation can be easily bounded by 0*(n~(^+ ''^). This is not 
good enough in our case. We sharpen the bound using Lemma [9] in Section 14.21 leading 
to the following theorem, whose complete proof is contained in Section 14.11 

Theorem 7. Let 6 > 0, and let uj = {I + 6) In 6/6. For all b > 6o((5), 

ln(A/(L01a;6)2)\ 



Tl-eiax = ^{n ), where d = il + 



2 In 6 



Theorem E] is a simple corollary of Theorem [7] by noticing that d = 1 + 6/2 — O ( '?'! ) . 
Furthermore, we can "hide" the fact that 6 > 6o(5) in this residual term, by using the 
trivial lower bound Q{n) for all 6 < bo{d) and bo{d) ^ exp{6~^ ln{6~^)) as 5 — )• 0. 

4.1 Proofs. 

Note that throughout this paper, we will use the following notations for the relationships 
between two functions f{x) and g{x) for simplicity. If lima;_>.oo f{x)/g{x) = 1, we write 
f{x) ss g{x); if /(x) = 0{g{x)), we write f{x) < g{x) and if f{x) = Q{g{x))^ we write 
f{x) >g{x). 

Proof of Lemma\^ Let x = {0, 1} be a valid configuration on the path V. Conditioning 
on K{ui) = x(i) for all i, we know that the events R^iwij) = are independent for all i and 
j. Given «;(uj) = 0, the probability of the event Rf^{wij) = equals Pr^^^--^ [BW(r/) = 0]; 



and given «:(mj) = 1, the probability of this event can be trivially upper bounded by 1 
(this bound is "safe", in the sense that the actual quantity is close to 1 for big A). By 
this, we can conclude that 

UhiBWiK) = 1 and BW(k") = 0) 

< Y, M^^ ■■ V^, n{u,) = x(u,)) J] Pr„^.,_i [BW(r?) = of-^^ 



E. 



i>0:K.{ui)=0 

n Pr,..,_,[BW(^) = 0](^-^) 

i>0:re(ui)=0 



n 



2(1.01)(ajfe)^ 



Proof of LemmalSi In the proof, we will use the fact that exp ( ^'-^■'^^a'^"; \ < i.oi, when- 
ever b > bQ{6) (Lemma [8|) . Now, for simplicity, denote fi = Pr^^i,._j [A{i]) = 0]. First of 
all, notice the recurrences 

/i = :r- — ) /2 



1 



1 



l+OJ 

1 



l+LO 






l+UJ 



The result follows by an easy induction: For h = 1,2, the result is clear. On the other 



hand, from the previous recurrences, if it is the case that fi < 



(1.01)1/'' 

l+LU 



then 



fLi< 



UJ 



< 



l + UJ 



l+UJ 



l-/tl 



+ 



1 



n b 



1+LJ 



LOlw 
A 



+ 



l+UJ 



^ ^ + ^^ Y^ exp(1.01(a;b)VA) ^ 1-01 



A_ 

l+UJ 



I+OJ) 



{1+ujY 



where the third inequality follows from the fact that (1 — n) > 1 — ub for u < 1, the 
fourth inequality follows from {1 + u) < e", and the last inequality follows from the fact 



that exp 



2(1.01)(a;fe)^ 
A 



< 1.01 for 6> boi5). 



D 



Proof of Theorem [^ From Lemma [9] in Section 14.21 we have that 



E 



1.01^(1 +a;l 



A 



#{i:a{u,)=Oy 



1 + 



1-e 



2e(l + w) 



(1 + e) flMu) 

~2>r 



1 + V1 + 4A/1.01 



where e = 
Therefore, 



y^l + 4A/1.01 . The previous term is asymptotically dominated by ( 1.01^4y2 

1+-^ ^TT-r '- 



Sa = 



LOlw 



O n 



Now, from [6l Section 5], it is known that the BW algorithm is effective for u > 
(1 + (5) In 6/6 and 6 > bo{6). (This can also be deduced from Lemma [6] by following the 
same steps as in Proposition [T6]) . therefore Theorem H] applies. The conclusion follows. D 

4.2 Some Technical Lemmas. 

Lemma 8. Define bo{6) = min{6o : exp ('HiiMMi^ < i.oi for all 6 > 6o}. Then bo{5) 
is a continuous function such that 

1. bo{S) < oo for all 6 > (that is, it is well defined). 

2. bo{6) ^ exp ((1 + o{l))6-^ H^'^)) as 6^0. 

3. 6o((5) ~ 6o(oo) as 5 —7- oo, where 6o(oo) is a fixed constant < 2. 
Proof. For (1), just notice that for 5 > fixed, we have that, as 6—7-00, 

'2(1.01)(a;6)2' 



exp 



1 < l.OL 



For (2), notice that if we let 6 = exp (/35 ^ ln{6 ^)), then as 5 — t- 0, we have that 

00 as (5 — )• 0, while if /? > 1 the 



exp 



Therefore, if /3 < 1, it is the case that exp ( ^ ' ^'^ ' 
same expression goes to 1. 



For (3), notice that for 6 fixed, exp 



2(1.01)(i^6)" 
A 



exp 



0(1) 



as (5 — )• 00. 



D 



Lemma 9. Let ^0;Ci)--- be a Markov process with state space {0,1}, such that (^0 
and with transition rates po^o = p, Po->i = Q, Pi^o = 1; Pi^i = 0. Let N^ 
#{1 < i < /i : Ci = 0}, then 



E[, 



,Nh] 



p{l - e)\ (1 + e) fpa 
2^ 



1 + Vl + 4g/ (ap2) 



where e = , ^ . Moreover, if the transition rate po-^o is inhomogeneous but such 

y/l+4,q/{ap^) 



P - Po-^0 



that 

eK'']<(i + 

where e 



< 6, then 

ip + 6){l-e )\ (1 + e) ( ip + 6)a 
2e 



l + Jl + A{q + 6)/(a{p + 6) 



^l+4{q+S)/{a{p+5f) ' 
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Proof. Straightforward combinatorics leads to the expression 

F r^^hl - Y^LV2J (h~k\h-2kkh-k j_ s-^[(h+l)/2\ (h-k\h-2k+lkh-k 

Now, for the first term, we have that 

where x = -^, which by standard saddle point methods, noticing that the function 
(j)(t) = lim h~ In { 7h )x reaches its maximum at the point t* = 2(1 — e); where 



e = 1/V1 + 4X and (j)"{t*) = Y'7(T3i)(i+^, implying that 

1 ^[h/2] (h-k\h-2kkh-k _ (1 + g) /^ l + \/l + 4x 



{pa) 
Similarly, 



1 s:^[{h+i)/2\rh-k\ h-2k+i k h-k ^, P(l - g) (1 + e) /^ l + Vl + 4g/(ap2) 

;2^fc = l \k-l)P 



^pa^h^k=i \k-v^ ^ - 2e 2 \^ 2 y ' 

from where the result follows. In the inhomogeneous case, we have that 

from where the result follows using the same asymptotic. D 



5 "Bad" Boundary Conditions: Proof of Theorem [Tll2b 



First, we will show that for any to, there exists a sequence of boundary conditions, denoted 
as Fij := {rj}j>o, one for each complete tree of height i > 0, such that if z — )• 00, the 
probability of the root being occupied converges to j^- Later in this section we will 
exploit such a construction to attain in full the conclusion of Part [2b] of Theorem [TJ 

As a first observation, note that, the Gibbs measure for the hard-core model on Tj with 
boundary condition F is the same as the Gibbs measure for the hard-core model (with the 
same activity A) on the tree T obtained from Tj by deleting all of the leaves as well as 
the parent of each (occupied) leaf v €T. It will be convenient to work directly with such 
"trimmed" trees, rather than the complete tree with boundary condition. Having this in 
mind, our construction will be inductive in the following way. We will define a sequence of 
(trimmed) trees {{Li, Ui)}i>o such that Li^i is comprised of Sj+i copies of Ui and b — Sj+i 
copies of Li with {sj}i>i properly chosen. Similarly, f/j+i is comprised of tj+i copies of Ui 
and b — tj+i copies of Ui, with {ti}i>i properly chosen. 

We will show that, for either T* = Li, or T^* = U, it is the case that the 'Q'-value, 

defined as: 

ut* (c(?") = 1) 

ujfiT* {(T{r) = 0) 
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where /xt*(-) is the hard-core measure on the trimmed tree T* , satisfies Q{T*) — ;• 1. Note 
that if Q{T*) = 1, then the probabihty of the root being occupied is oj/{\ +uj) as desired. 
To attain this, we wiU construct Li and Ui in such a way that Q{Ui) > 1 and Q{Li) < 1. 
The recursion for Q(-^i+i) can be derived easily as 



QiU, 



(l+a;) 



il + ojQiU^)y^+^ia + u;Q{Li)) 



6-Si-i 



and a similar equation holds for Q{Ui+i) by replacing Sj+i with tj+i. 

To keep the construction simple, we inductively define the appropriate U and Si, so 
that once Li and Ui are given, we let tj+i be the minimum choice so that the resulting 
Q- value is > 1, more precisely, we let: 

t.,, = argn.in{Q = (i + Q(^^)^.(t + 1q(L.))-^ ^ ^ ^ ^^- 

And similarly, we let: ..+i = argmaxjQ = (i + q(^^)).(i + q(^^))5-. ^ ^ < !}• 

The recursion starts with Ui being the graph of a single node and Li being the empty 
set, so that Q{Ui) = X/co and Q{Li) = 0. Observe that, by definition, Sj+i G {tj+i,tj+i-|- 
1} and that the construction guarantees that the values Q{Li) are at most 1, and the 
values Q{Ui) are at least 1. The following simple lemma justifies the correctness of our 
construction. 

Lemma 10. 

lim Q{U{)/Q{Li) = 1. 

Proof. It is easy to see that either ti = Si (meaning that Q{L,i) = Q{Ui) = 1), or ij = Sj — 1, 
which implies that 

QjUi) ^ l + ojQ{U,-i) Q{U-i) 

Q{Li) l + ujQ{Li^i) Q(L,_i)- 
So the ratio is shrinking. Suppose the limit is not 1 but some value q> 1. Then, 

Q([/,_i) Q{Ui) ^ Q([/,-i) - Q(L„i) 
Q(L,_i) Q{Li) (l + ^Q(L,_i))Q(L,_i)- 

Since Q{Ui)/Q{Li) > q and Q{Li) < 1, we have 

QiUi^i) - Q{Li^i) ^ {q- l)Q{L^-l) _q-l 



(1 + ioQ{Li^i))Q{Li^i) - g(L^-i)(l +uj) 1 + a; ' 

which is a constant. 

Therefore as long as g > 1, we show that the difference between the ratios for each 
step i is at least some constant which is impossible. Hence the assumption is false. D 

By this lemma, it is easy to check that if we let T* to be equal to either Ui or Li, then 
Q{T*) — 7- 1. Indeed, we can show that the additive error decreases exponentially fast. The 
following lemma indicates that this is the case for a; < 1 (although a similar result holds 
for any oj). 
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Lemma 11. Let ef be the value of Q{Ui) — 1 and let e^ he the value of 1 — Q{Li), then 

4+1 + ^i+i <w(e+ + e-). 
Proof. We can rewrite the expression 

(1 + u;)V(l + u:Q{Ui)y{l + u:Q{U)f-^ 

as 

1 



(l + Tf^^.+)^'(l-TT^er)^-r 



Now, let k be the biggest index over \b] such that the denominator of the previous expres- 
sion is less than 1 (thus, fc + 1 will be the least index such that the denominator is greater 
than 1). Then, 



1 1 



g-^ + g- = 

i+l i+l I-, I u ,+ \k(-\ _ <^ r~\b-k (-1 _|_ _iii_c+\k+l('] _ _!£_,- \6-fc-l 

1^(4 + ^7) 



< — - — ^— — z — (by the property of /c + 1) 

< ^(4 + ^7)- 

D 

Coming back to the original tree-boundary notation, let T\ be the boundary corre- 
sponding to the trimming of the tree U^ and let r| be the boundary corresponding to 
the trimming of the tree Lh- By our construction, for any vertex v on the tree of height 
h, the measure from fJ-hri (or ^/jr2) projected onto the space of the independent sets of 
the subtree rooted at v with the boundary condition corresponding to the correct part of 
r and the parent of v being unoccupied is either fj,^ pi or fi^ p2 , where i is the distance 
of V away from the leaves on Th- Conditioning on the parent of v being unoccupied, in 
the broadcast process defined in the Introduction, we would occupy v with probability 
CO /{I + uj). Therefore, in the above construction, the probability v is occupied (or rather 
unoccupied) is close to the desired probability, and the error will decay exponentially fast 
with the distance from the leaves. This is formally stated in the following corollary of 
Lemma [TTl 

Corollary 12. Given any a; < 1 and the complete tree of height i, forV equal to T^ orVf 
inductively constructed above, we have 



fj.i,r{cr{r) = 0) 



1 + w 



< uj'-'X/b. 



Throughout the rest of this section it is assumed that we are dealing with the boundary 
conditions {F^j/igN and {Tf^}heN constructed above. We will then show that for every 
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a; = (1 + (5) In 6/6 under these two boundary conditions, the Glauber dynamics on the 
hard-core model slows down, whenever 6 > 0. As we know from Corollary I12( the error of 
the marginal goes down very fast, so that roughly we can think of the marginal distribution 
of the configurations on the tree from the root to the vertices a few levels above the leaves 
as being close to the broadcasting measure. In fact, by following the same proof outline 
as we did in Section HI we are able to prove the same lower bound in the hard-core model 
for these boundaries. To do that we need a slight generalization of the reconstruction 
algorithm and extensions of the corresponding lemmas used in that section to handle the 
errors in the marginal probabilities. 

To generalize the notion of a reconstruction algorithm to the case of a boundary condi- 
tion we need to add an extra parameter i depending only on uj and 6. We will essentially 
ignore the bottom i levels in the analysis, and we will use that for the top h — i levels the 
marginal probabilities are close to those on the broadcasting tree. We define a reconstruc- 
tion algorithm with a parameter I for the tree T^ with boundary condition F as a function 
Ai : Vi{Lh-i) — )• {0, 1}. The algorithm A^ takes the configurations of the vertices at height 
h — i as the input and tries to compute the configuration at the root. For any a G Q(Th^r), 
the sensitivity is defined as: 5^_a('7) = ;^# {v G L^-e '■ ^e{(^h-i) t^ ^ei^^h-e)}- The aver- 
age sensitivity of the algorithm at height h — i with respect to the boundary T is defined 
as: Sjj^ = Eo- [Si^A{<^)'^{Ae{(^h-e) = !)]• And the effectiveness is defined as: 

r^A = min [fJ'h,T{A{(^h-e) = x and a{r) = x) - l^h,riAe{ah-e) = x)fih,Ticr{r) = x)]. 
xe{o,i} 

We can show the analog of Theorem [13] in this setting. 

Theorem 13. Suppose that Ag is an effective reconstruction algorithm. Then, it is the 
case that the spectral gap Cgap of the Glauber dynamics for the hard-core model on the tree 
of height h with boundary condition T, satisfies Cgap = 0{Sj ^), and hence the relaxation 
time of this Glauber dynamics satisfies Trdax = ^(l/'S'f^)- 

To bound the average sensitivity for the boundary conditions Tj^ and r| constructed 
above, we again use the same BW algorithm as we analyzed for the broadcasting tree. As 
in Eq.(l2|) and ([3]), it is again enough to bound the probability 

PIbw ■■= l^h,T,i^We{ah-i) = 1 and BW,K_,) = 0) 

for a fixed vertex -u at a distance i from the leaves, although in this case, this probability 
will not be the same for all u. Let the path V from u to the root r he uq = u -^ ui -^ 
. . . -w Uh-£ = r, and for each i > and j £ {1, . . . , 6}, let Wij be defined similarly as in 
Section HI Further, let Tij be the boundary condition F/j restricted to the subtree T^. . 
of Th rooted at the vertex Wij. These subtrees are of height i -\- i — 1 for each i. Note 
that, by our construction of the boundary conditions, for each fixed i, Fjj = F|^^_^ or 
Tij = F^_(_^_j^. The probability -P/'gw ^^^^ ^^ calculated by the following lemma, which is 
the analog of Lemma [5] for the broadcasting tree. 



Lemma 14. 



Pe,BW — -^o 



J>0:crK)=0i=2 
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where the expectation is over the measure /^^.r^; and for each i,j, r] is a random configu- 
ration on the subtree rooted at Wij with the probability measure /ij4.£_i.ri ■ • 

The proofs of Theorem 1 1 3 1 and Lemma [T^ use the same proof approach as for Theorem 
Hland Lemma [5] respectively. However, to bound Pr^ [-Ae{r]) = 0] for every i > 0, in spite 
of going along the lines of Lemma [6l the proof does require extra care to deal with the 
errors in the marginal probabilities which were bounded in Corollary [T2l In particular, 
we will establish the following lemma to upper bound Pr^^^^^^ p_ [Ae{rj) = 0] for each 
i > 0. Here and throughout the text, we define i{X, b) to be the minimum i such that for 
all i > £, 

, , , , 1 / 1.01(^6)2 



The existence of such constant i{X, b) is guaranteed by Lemma [TOl also from Corollary [12] 
we can deduce a explicit value for i(X,b), provided that u < 1. 

Lemma 15. Given any S > 0, and i > i{X,b) = i, then both Pr^^^_ ^ i-^iiv) — 0] ^'^^ 

■ i 

Pr^^^_ a [^(:{v) = 0] '^'"^ upper bounded by -"""^^ for any b > bo{6). 

' i 

And also it is not hard to show that the BW algorithm under this setting is effective. 

Proposition 16. The BW reconstruction algorithm is effective to recover the configuration 
at the root from the configurations at distance i{X, b) from the leaves. 

Then, we are able to again bound S'Jg^v ^o'^ ^ ~ ^\ ^''^ ^\^ proving the following 
theorem, which completes the proof of Part I2bl in Theorem [TJ Interested readers can look 
up Section ISTTl for the complete proofs. 

Theorem 17. Let 5 > Q, and let w = (1 + 5) In b/b. For all b > 6o((5), it is the case that 

Trelax = f^K), wherC d = I 1+ ^ ' )^^^^ ' ' \. 

5.1 Proofs. 

Proof of Lemma\J5[ Let fi^i denote Pr^^^ ^ [Ai{r]) = 0] and let fi^2 denote Pr^^^ ^ [^^(^) = 0]- 

Let ti = b — ti and Si = b — Si for simplicity. Now, recall the recurrences given in Lemma 
El which in this case take the form 

,2 



/.+i,i = Pr,.^^^, Hr) = 1] ^1 - ^1 - ft,;,fti;2) (l " f!-ij!-i 
+ Pr,.,^^^Jr?(r)=0](l-/J^^/f-^ 



/.+i,2 = Pr,.^,,,. Hr) = 1] ^1 - ^1 - ft,Jtia) (1 - /^i,i/^i,2 
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And the base cases are: 



v^/j.. 



fe+1,1 
fe+1,2 



Pr 



r^r^/i 



Pr 



n^/j; 



Kr) = 0] (i - /;;+7'r 



£+i,r 



LOii/*- 



Our purpose now, is to show by induction that /j^i, /j^2 < '1^^ for all i > L The base 
case is simple: 



/£+i,i < /«,! < Prr,-/.^ 1 [r/(r) = 0] < — — exp ( 



im{ojhf 

A 



l.Oli/'' 



and the last term is less or equal to ^^j^ — for b > bo{S) [Lemma [5]. Similarly, it is the 

case that fe+1^2 < fe,2 < ^Tf^- Now, in general, assuming the inductive hypothesis, we 
get from the above recurrence, that 



/j+1,1 



< 



w 



l + w 



1-1 



l.Olo; 
A 



+ 



1 



l+CJ 



exp 



1.01(a;6)^ 
A 



where the first term, just as deducted in the proof of Lemma[6l is dominated by ^ n4- l^ ' 
Now, the hypothesis follows for /i+1,1 (and seemingly for /i+1,2); due to the fact that 
exp(2(1.01)(a;6)^/A) < 1.01 for b > bo{5), proving the induction. D 

Proof of Proposition\l(A Notice the formulas, 



//^^ri {A{a) = : a{T) = 0) = 1 - /x,_i^ri (.4(a) = 0) /i^.^^r^ {A{a) = 0) 

' h ' h~l ' h—1 



/^h,rl(^(^) = 1:^(0 = 1) 



1 - (/^h-2,ri SM<^) = 0) )*'"' (/^i^-2,r? (^(^) = 0)"*"^' 



ith 



l-(/^h-2,ri .(^(^) = 0))''"7^,, or? (^(ct) = 0)'''" 



-\th 



whith similar expressions for fi^ r2 (•). Now, from these recurrences and the bounds stated 

' h 

in Lemma fl5| we deduce that 



/., ri iA{a) = 0, a{r) = 0) - ^, pi {A{a) = 0)^. ri ('^(O = 0) 



^ 1 



1.01 1.01^/^ 



{i + uY 



l+UJ 



>0 



With the same result holding for /i^ p2 



D 
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Proof of Theorem 11. It goes along the lines of the proof of Theorem [71 Here, we take (. 
as ^(A, 6), as in Lemma [TSl Now, due to Lemma [T4l we have that 



imuj{i + uj] 



A 



#{i:aK)=0}' 



^e.,BW — O [ E 
and using the second statement of Lemma [H we have that 



E 



A 



< 



l.Olo; exp ( ^j-^^ 



2A 



1 + 



1 + 



4A 



\ LOlexpl'i 



X 



( \^~^ 

where the previous term is dominated by I 1.01^-y72 ) • And just as in the proof of 
Theorem [TJ 



Sa = 



LOlw 



Ai/2 



O n 



In ("A/fl. 011^6)2) 
IH ^^ - 



Now, from Propositionll6t the BW algorithm is effective for w > (1+5) In 6/6, 6 > 6o(5), 
therefore Theorem 1131 applies. The conclusion follows. D 



h-n 



6 Upper Bounds of the Relaxation Time via the Couphng Method 

We will use the coupling technique in some of our proofs which upper bound the relaxation 
time and the mixing time. The mixing time Tmix for the Glauber dynamics is defined as 
the number of steps, from the worst initial state, to reach within variation distance < l/2e 
of the stationary distribution. It is an elementary fact that the mixing time gives a good 
upper bound on the relaxation time (see, e.g., [18] for the following bound), we will use 
this fact in our upper bound proofs: 



^ relax 



< Tniix + 1. 



(4) 



Given two copies {Xt) and (Yj) of the Glauber dynamics, a coupling is a joint process 
(X(,l^) such that the evolution of each component viewed in isolation is identical to the 
Glauber dynamics, see [18] for more background on the coupling technique. The Coupling 
Lemma [2j (c.f., [181 Theorem 5.2]) guarantees that if, there is a coupling and time t > 0, 
so that for every pair [X^^Yq) of initial states, Vv\Xt / Yt \ Xo,yo] < l/2e under the 
coupling, then Tmix < i- 

Before we show the main idea of proving the upper bound, let us introduction some 
notations for this section first. Let Tp be the relaxation time of the following Glauber 
dynamics of the hard-core model on the star graph G, where p is a 6 dimensional vector. 
The dynamics is defined as: if the root is chosen, it will be occupied with probability 
A/(l + A) if no leaf is occupied; if the leaf i is chosen, it will be occupied with probability 
Pi if the root is unoccupied. Therefore r := maXp{Tp} is defined as the worst case relaxation 
time over all the possible choice of p. Following the same block dynamics strategy [20j as 
in Section 2.3 of the paper by [3] (See also [19] and [30]), it is not hard to show that the 
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relaxation time of the above Glauber dynamics is exactly the same as that of the natural 
block dynamics which updates the configurations of a whole subtree of the root in one 
step, and hence the following lemma holds. 

Lemma 18. The relaxation time Tj-ei of the Glauber dynamics of the hard-core model on 
the complete tree of height H is upper bounded by t^ for any boundary condition on the 
leaves. 

Note that, the relaxation time is quite sensitive with respect to the boundary con- 
ditions. Especially, in the paper by [22], they show that when the boundary conditions 
are even (or odd) meaning that occupied all the leaves when the height is even (odd) 
and unoccupied all the leaves when the height is odd (even), the mixing time is actually 
O(nlnn). In this paper we are dealing with any kind of boundary condition, and in the 
lower bound part, we show the boundary conditions that slow down the Glauber dynamics 
actually exist. The lower bound on the relaxation time for the Glauber dynamics under 
that boundary conditions roughly matches up with the upper bound here. 

We need to bound the relaxation time r of the Glauber dynamics on star graphs for 
different cases with respect to p. To do this, we first use the so-called maximum one 
step coupling to bound the mixing time and then by Eq.Q we get an upper bound on 
the relaxation time r. The maximal one-step coupling, originally studied for colorings 
by Jerrum |14] gives a way to upper bound the mixing time of the Glauber dynamics 
on general graphs. The coupling {Xt,Yt) of the two chains is done by choosing the same 
random vertex v for changing the states at step t and maximizing the probability of the 
two chains choosing the same update for the state of v. Thus, if none of neighbors of v is 
occupied in neither Xt and Yt, v will be occupied/unoccupied in both chains at time t + 1 
with the correct marginal probability. In all other cases, the update choices for Xt^i(v) 
and Yt-\.i{v) are coupled arbitrarily. By analyzing the maximum one-step coupling, we will 
show a series of lemmas for different p, which give the upper bound on the relaxation time 
r. 

Lemma 19. If^pi < 4 In In 6 and for all i, Pi < 1 — l/ln5, then r = 0{b^^°^^'), where 
o(l) is a term that goes to zero as b goes to infinity. 

Lemma 20. // X] Pi > 4 In In 6, then t = 0{{X + l)b In b). 

Lemma 21. If there exists an i such that pi > 1 — l/ln6, then r = 0((A + 1)6 In 6). 

The intuition behind is that the typical behaviors of the coupling chain change with 
respect to the leaves marginal probability p. When p are all tiny (Lemma ll9p . the coupling 
chain has a large chance to have all the leaves unoccupied in both (Xt) and (Yt) and 
hence can be coupled at the state where the root is occupied in both chains with a good 
probability; While when the sum of p is big (Lemma 1201 and [2T]) suggesting that there is a 
good chance that one of the leaves is occupied always, the coupling chain is easier to get 
coupled once the root is not occupied in both chains. We delay the proofs of these lemmas 
in Section 16.11 By applying Lemma [18] to Lemma [T9l [20] and [2T] we get the conclusion 
that the relaxation time is upper bounded by (9(77,i+i°('^+i)/ iii^+"b(i)) for any A > 0. Recall 
that the relationship between uj and A is A = u;(l + w) and in this paper we are mainly 
interested in the cases when a; = (1 + 5) In 6/6 for any constant 5 > —1. Hence, in terms 
ofw, if— 1 < 6 < the relaxation time is upper bounded by n^+°b{^) ^nd if (5 > the 
relaxation time is upper bounded by n^~^ '^"bW ^ which proves Theorem [1] 
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6.1 Proofs. 

With out lose of generality, we assume that the root is unoccupied in Xq and occupied in 
Yq. Readers will see later in the proofs that these are indeed the worst cases scenarios. 
We prove the lemmas by analyzing the coupling in rounds, where each round consists of 
T := 206 In 5 steps. The following analysis says that if in each round we have a good 
probability of coalescing (i.e., achieving Xt = It), then we will have a good upper bound 
on the mixing time (and hence the relaxation time). Suppose that in the coupling process, 
starting from any pair of initial states, after time T, with probability Pt we have Xt = Yt- 
By repeatedly applying this assumption i many times we have, for all {xo,yo), 

Pr [X2^T / Y2iT I Xo = xq, Yq = Vo] < (1 " Pt?' < l/2e 

for i = 1/Pt- Therefore, by applying the Coupling Lemma, mentioned in Section O the 
mixing time is 0{b\nb/PT)- Now, the only thing left is to lower bound Pt for different p 
assuming that the root r ^ Xq and r gYq. Let us define for any configuration Xt £ ^(G), 
\\-^t\\ '■= \{i G ^(G) ■ deg{£) = 1 and i G Xt}\, the number of leaves of the star graph G 
that are in the independent set. 

Proof of Lemma \19l In this case, we will try to couple all of the leaves to unoccupied and 
then couple the root. Let Tq be the last time the root is chosen to change the state in 
the T steps running of the coupling chain. It is a simple fact from the coupon collector 
problem that with high probability, Tq > 106 In 6 and all the leaves have been chosen at 
least once before Tq. Note that throughout this section "with high probability" means 
the probability goes to 1 as 6 goes to infinity. Conditioning on this, consider the following 
process (Bt): 

• Bt C V{G); 

• For each time t the process (Bt) chooses a leaf Vi to update: -Bt+i = BtU Vi with 
probability pi and Bt+i = Bt\ {vi} with probability 1 — pi; 

• The process will ignore any update of the root and the state of the root at any time. 

By the conditioning we assume, we know that after 106 In 6 steps, every leaf has been 

updated at least once. Therefore the chance that all of the leaves are not in Bt (i.e. (Bt) 

is an empty set) is 

6 6 



n(i - Pi) > ne"'''^'"'"' > e-^l'^'''^^ 



where 1 — x > e~'^ holds when x < 1 — 1/ In 6. There is a straightforward coupling 

of (Xt) and (Bt) such that for any t > 0, \\Xt\\ < \Bt\. And also, by the natural of the 
maximum one-step couplings with the assumption r ^ Xq and r G Yq, for any t > 0, 
\\^t\\ ^ 11^ II- Therefore, we can conclude that with probability at least e~^ , at time 

To — 1, all the leaves are unoccupied in both X and Y, and hence with probability 1, when 
the coupling chain chooses the root to update at time Tq, we have Xtq = Ytq ■ In conclusion, 
we show that with probability at least e"^'"^ ^^^, we have Xt = Yt, which implies that 
the mixing time and hence the relaxation time is 0(61n6 • e'^^^ ''^'') = 0{b^^°''^^'), where 
Ofe(l) is a 0(ln In 6)/ In 6 function. D 
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Proof of Lemma [2^ In this case, we want to first unoccupied the root in both chains and 
then couple the leaves. The key observation is that once the root is not occupied, it will 
not be occupied again since there is a good chance that one of the leaves is always occupied 
and hence "blocking" the root from being occupied. First of all, it is easy to see that in the 
first b steps, the coupling chain has a positive probability of choosing the root to change 
the state and at that moment, the chance of the root being unoccupied in both chains is 
at least 1/(A + 1). We denote this as event £i. Next, we want that in the following b steps, 
with positive probability, the root will not be chosen to change the state, therefore the 
chain chooses b leaves to change the states during these b steps. Let the set of leaves that 
are chosen during these b steps be S. We want to argue that with positive probability, 
Ylies Pi -^ '^0 In In ^ for some constant cq > 1 so that S" is a set of good representatives of 
all the leaves. Let this be the event £2- By using the fact that for each leaf, the indicator 
random variable of whether the leaf is chosen or not during these b steps are negatively 
associated to each other (c.f.. Theorem 14 in |9j), we can still use the following Hoeffding 
bound (Theorem 2 in [13j) by Proposition 7 in [9]. 

Theorem 22. Let Xi, . . . ,X„ be independent (or negatively associated) random variables 
with tti < Xi <bi. Let X = Yli^i o-nd /i = E[X]. Then the following inequality holds 

Pr [\X - Ml > i] < 2exp(-^^,^- —^). 

In our case, Xi = pi if the leaf i is chosen in b steps and otherwise zero. Therefore, 
since in our case Y17=i(.'^i ~ ^«)^ ^ {J27=ii^i ~ o«))^ = (41nln6)2, fi > 4(1 — 1/e) Inlnft and 
t = 5p for some properly chosen (5 > 0, it is straightforward to show that with positive 
probability, the event £2 happens. Note that the events £1 and £2 are actually independent. 
And from now on, we are conditioning on £1 and £2- 

For each t > 26, let At := {£ €z S : £ €z Xt and i E 1^} be the set of leaves in S such 
that they are in the independent set at time t in both Xt and Yj. Let the stopping time 
Tg be the first time t > 2b such that At = 9 and the root is chosen. If we can show 
that Pr [Ts < 26 In 6] < 1/2, then with positive probability, the root is always blocked from 
changing to occupied since At ^ f/i. Hence once the chain chooses all the leaves during 
26 In 6 — 26 steps, we will have Xt = Yt- Finally, because the events £1 and £2 happen with 
probability 0(1/(A + 1)), we can conclude that with probability ri(l/(A + 1)), Xt = Yt- 

In order to show that Pr [Tg < 26 In 6] < 1/2, we will use the same coupling strategy 
as in the last proof. Let (Ct) be a stochastic process that update the states of \S\ many 
independent vertices in the following way: 

• The process starts at time t = 26 + 1 with a random starting configuration; 

• For any t, Ct C S; 

• For each time t the process (Ct) chooses a vertex v G V{G) randomly; 

• li V = Vi £ S then apply the following update rule: Ct+i = CtU Vi with probability 
Pi and Bt+i = Bt\ {vi} with probability 1 — pi. 

Let the stopping time Tc be the first time t > 26 such that Ct = ^ and the root is chosen. 
We want to couple {Ct) and {Xt,Yt) in such a way that Tg > Tc for all runs and then if 
we can show that Pr [Tc < 26 In 6] < 1/2, by the coupling, we are done. 
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In fact, it is easy to show that Pr [Tc < 26 In 6] < 1/2 (any positive constant will do) 
by using union bound combining with the following two facts: for any fix time t > 26, the 
probability that Cj = is upper bounded by (Inft)"*^" with cq > 1; with high probability 
the root will be chosen at most 0(ln6) times during T steps. Now we just need a valid 
coupling. 

We couple (Ct) with the coupling chain {Xt,Yt) from time t = 26 + 1 in the following 
way: 

• Initially, for t = 2b, v £ Ct if and only if t; £ At- 

• For any time t, both of them pick the same vertex v to update and ii v £ S, v G Ct 
if and only if v G At- 

It is easy to verify that conditioning on the events £i and £2, this coupling is valid and 
Ts> Tc, which completes the proof. D 

Proof of Lemma [¥I[ This case is the same as the last one. However, we can just use the 
leaf i to block the root since it has a very high probability of being occupied, i.e., for the 
probability bound concerning Tg, it is enough to bound the probability that at a given 
time t, the leaf i is occupied. D 
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